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ABSTRACT 

An (upward) skip-free Markov chain with the set of nonnegative integers as state 
space is a chain for which upward jumps may be only of unit size; there is no restriction 
on downward jumps. In a 1987 paper, Brown and Shao determined, for an irreducible 
continuous-time skip-free chain and any d, the passage time distribution from state 
to state d. When the nonzero eigenvalues fj of the generator on {0,... ,d}, with d 
made absorbing, are all real, their result states that the passage time is distributed as 
the sum of d independent exponential random variables with rates Uj. We give another 
proof of their theorem. In the case of birth-and-death chains, our proof leads to an 
explicit representation of the passage time as a sum of independent exponential random 
variables. Diaconis and Miclo recently obtained the first such representation, but our 
construction is much simpler. 

We obtain similar (and new) results for a fastest strong stationary time T of an er- 
godic continuous-time skip- free chain with stochastically monotone time-reversal started 
in state 0, and we also obtain discrete-time analogs of all our results. 

In the paper's final section we present extensions of our results to more general 
chains. 
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1 Introduction and summary 

An (upward) skip-free Markov chain on the set of nonnegative integers is a chain for 
which upward jumps may be only of unit size; there is no restriction on downward jumps. 
Brown and Shao pQ determined, for an irreducible continuous-time skip-free chain and 
any d, the passage time distribution from state to state d (we have equivalently re- 
identified the exponential rates): 

Theorem 1.1 ([I]). Consider an irreducible continuous-time skip-free chain Y on the 
nonnegative integers with Y(0) = 0. Given d, let X (with state space {0, ...,d}) be 
obtained from Y by making d an absorbing state, and let G denote the generator for X . 
Then the hitting time of state d (same for X and Y) has Laplace transform 



where Uq, . . . , Ud-i are the d nonzero eigenvalues of —G (known to have positive real 
parts). In particular, if the Uj 's are real, then the hitting time distribution is the convo- 
lution of Exponential^) distributions. 

In this paper we give a new proof for Theorem 11.11 In the case of birth-and-death 
chains (which are time-reversible and thus have real eigenvalues), the result is often 
attributed to Keilson but in fact (consult [3]) dates back at least to Karlin and Mc- 
Gregor [9] . Our proof leads in this case to an explicit (sample-path) representation of 
the passage time as a sum of independent exponential random variables. Diaconis and 
Miclo [3] recently obtained the first such representation for birth-and-death chains, but 
our construction is much simpler. 

There is an analog for discrete time: 

Theorem 1.2. Consider an irreducible discrete-time skip-free chain Y on the nonneg- 
ative integers with Y(0) = 0. Given d, let X (with state space {0, . . . ,d}) be obtained 
from Y by making d an absorbing state, and let P denote the transition matrix for X. 
Then the hitting time of state d (same for X andY) has probability generating function 



where 0o, ■ ■ ■ ,9d-i are the d non-unit eigenvalues of P. In particular, if every 9j is real 
and nonnegative, then the hitting time distribution is the convolution of Geometric (1 — 
9j) distributions. 

Remark 1.3. The conclusions of Theorems 1 1 . 1 1 and 1 1 . 2 1 hold for all skip- free chains X 
on {0, ... , d} for which cj^j+i > (respectively, Pi^+i > 0) for i = 0, . . . , d — 1. Indeed, 
our proofs extend to that case, or one can extend the theorems by a simple perturbation 
argument. 



d-l 
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Results similar to Theorems 1 1 . 11JL21 can also be established for fastest strong station- 
ary times (consult [2], [3], and [5] for general background, and (3.3) and (3.5) of [5] for 
how to check from the generator G of a continuous-time chain whether the chain has the 
"monotone likelihood ratio" property, that is, whether its time-reversal is stochastically 
monotone) : 

Theorem 1.4. 

(a) Consider an ergodic (equivalently, irreducible) continuous-time skip-free chain X 
on the state space {0, . . . ,d} with X(0) = and stochastically monotone time-reversal. 
Let G denote the generator for X. Then a fastest strong stationary time (SST) for X 
has Laplace transform 



where i>q, . . . , \ are the d nonzero eigenvalues of —G (known to have positive real 
parts). In particular, if the Vj 's are real, then the fastest SST distribution is the convo- 
lution of Exponential^ ) distributions. 

(b) Consider an ergodic (equivalently, irreducible and aperiodic) discrete-time skip-free 
chain X on the state space {0, . . . , d} with X(0) = and stochastically monotone time- 
reversal. Let P denote the transition matrix for X. Then a fastest strong stationary 
time (SST) for X has probability generating function 



where 9o, ■ ■ ■ ,0d-i are the d non-unit eigenvalues of P. In particular, if every 6j is real 
and nonnegative, then the fastest SST distribution is the convolution of Geometric(l — 9j) 
distributions. 

Our proofs of Theorems I1.1H1.2I and 11.41 could hardly be simpler in concept. Take 
Theorem 11.21 f° r instance. Let 9o, ... ,9^-1 be ordered arbitrarily. We will exhibit a 
matrix A, with rows and columns indexed by the state space {0, ...,d}, having the 
following properties: 

(i) A is lower triangular. 

(ii) The rows of A sum to unity, 
(hi) AP = PA, where P is defined by 





Uj + u 




3=0 




Pij 



9i if j = i 
1-9, if j = z + 1 
otherwise, 




for i,j = 0, . . . , d (with 9^ = 1). 
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We will prove that from these three properties follows 



P(T < t) = P*(0,d), t = 0,l ) ... 



(1.2) 



where T is the absorption time in question. [In fact, all we will need to prove (|1.2|) 
are (hi) and the following two easy consequences of (i)-(ii): 



In the special case that the 9j's are real and nonnegative, then Theorem 11.21 follows 
immediately from (jl.2)) : the general case will not take too much additional work. More- 
over, for birth-and-death chains with nonnegative eigenvalues the proof will reveal how 
to identify (i.e., construct) geometric random variables summing to T. 

In Section 1 of [6] it was shown how that paper's main continuous-time theorem 
(Theorem 1.1) follows immediately from the main discrete-time theorem (Theorem 1.2). 
By the same one- line proof, in the present paper Theorem 11.11 follows immediately from 
Theorem ll.21 For this reason we choose not to present a direct proof of Theorem [T7T], but 
in the case of birth-and-death chains we will give the explicit construction of independent 
exponential random variables summing to the absorption time. For the same reason we 
will prove only part (b) of Theorem 11.41 

Here is an outline for the paper. In Section [2] we prove Theorem ll.21 and in Section [3] 
we give stochastic constructions for Theorems I1.1H1.2I in the special case of birth-and- 
death chains. Section 2] is devoted to a proof of Theorem ll.4f b). In Section [5] we present 
extensions of our results, including stochastic constructions, to more general chains. 



In this section we prove Theorem 11.21 following the outline near the end of Section [TJ 
Denote the eigenvalues of P, with each distinct eigenvalue listed as many times as its 
algebraic multiplicity, by Oq, . . . , 9^. We claim that precisely one of these, say 6>^, equals 1 
and that the rest have modulus smaller than 1; how the rest are ordered will for the 
majority of our results be irrelevant (we shall take special notice otherwise). 

Here is a proof of the simple claim. Let P' denote the leading principal d-by-d 
submatrix of P. Expanding the determinant of P — 91 on the last row, we find that 
the d + 1 eigenvalues of P are 9 = 1 together with the d eigenvalues of P'. But by our 
assumptions that Pi^+i > for t = 0, . . . , d — 1 and d is absorbing, we have (P'Y — > 
and so (see for example [8j Theorem 5.6.12]) p(P') < 1. 

Let / denote the identity matrix and define 



Q k : = (l-flo)" 1 •••(!- 9 k _ 1 )- 1 (P-6 I)---{P-e k _ 1 I), k = 0,...,d, (2.1) 



(iv) A(0,A;) =S , k . 



(v) A(k, d) = if k + d.] 



2 Proof of Theorem [C2 



with the natural convention Qq := 1. Note that 



Q k P = OkQk + (i - 0k)Qk+i, k = o,...,d-i 



(2.2) 



and that the rows of each Q k all sum to 1. 
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Define 

A(i,j) :=Qi(0,j), i,j = 0,...,d. (2.3) 

Lemma 2.1. Let P be as in Theorem \1.SX Then the matrix A defined at (|2.3[) enjoys 
properties (i)-(iii) listed in Section^ 

Proof. The rows of A sum to unity because each of the basic factors (1 — 9 r )~ l {P — 6 r I) 
in (|2.1|) has that property; and since P is skip-free, A is lower triangular. Our next 
claim is that AP = PA, where P is defined at (jl.ip . Indeed, equality of fcth rows is 
clear from (|2.2|) for k = 0, . . . , d — 1 and from the Cay ley-Hamilton theorem for k = d. 

Remark 2.2. It is not surprising that the eigenvalues of P line the diagonal of the 
upper triangular matrix P, because our A is nonsingular and so property (iii) implies 
that P and P are similar. To see that the lower triangular matrix A is nonsingular, 
we check that its diagonal entries are all nonzero. Indeed, for k = 0, . . . , d, we use the 
skip-free property of P to calculate: 

A(M)=g>(o.*)= (1 _y ) ;;;^_ i) ^a 



So we have found a matrix A satisfying properties (i)-(iii), and properties (iv)-(v) 
follow easily from (i)-(ii). Notice that property (iii) immediately extends to 

AP* = P*A, * = 0, 1, (2.4) 

Lemma 2.3. Let P and X , with absorption time T, be as in Theorem ] and let P be 
the bidiagonal matrix defined at (jl.ip . Then (j 1 . 2 j) holds: 

p(r < t) = p*(o, d), t = o,i,.... 

Proof. Let A be any matrix enjoying properties (iii)-(v) of Section [TJ recall that Lemma 
12.11 provides such a A. We will use A to establish the desired result, and the conse- 
quence (I2.4p of property (iii) will be key. 

We consider the (0, d) entry of each side of (|2.4p . On the left we have 

(AP*)(0,d) =^A(0,£)P*(M) = P\0,d) = P(T < t), 
k 

where the second equality is immediate from property (iv). On the right we have 
(P*A)(0, d) = P*(0, k)A(k, d) = P*(0, d)A(d, d), 

k 

where the second equality is immediate from property (v). So all that remains is to 
establish that A(d,d) = 1. 

We do this by passing to the limit as t — > oo in our now-established equation 

P(T <t) = P\0,d)A{d,d). 
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The limit on the left is, of course, 1. Moreover, we claim that P t (Q,d) — > 1, and then 
the proof of the lemma will be complete. The claim is probabilistically obvious when 
the eigenvalues are all real and nonnegative. To see the claim in general, write P as the 
bordered matrix 

" A b 
1 

by breaking off the last row and column. The matrix A has spectral radius p(A) < 1; 
and because the rows of P sum to unity, we have b = (I — A)l, where 1 denotes the 
vector of d ones. In this notation we find 



P 1 



A 1 (I + A + --- + A t ~ 1 )b 
1 



{I-A)- l b 
1 



which is a matrix with Is throughout the last column and Os elsewhere. In particular, 
F*(0,d)->1. □ 

As mentioned in Section [H Lemma 12.31 is clearly all that is needed to prove Theo- 
rem [L2] when the eigenvalues of P are nonnegative real numbers. In general we proceed 
as follows. From Lemma 12.31 it follows that T has probability generating function 



Eu 1 



(l-it)(/-itP) _1 (0,d), \u\ < 1. 



(2.5) 



But the simple form of P makes it an easy matter to invert 1 — uP explicitly: the inverse 
is upper triangular, with 



(I-uP)-\i, 
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(1 - 0$ ■ ■ ■ (1 - Bj- X )-w 



0<i<j<d. 



(1 - Oiu) ■■■(!- 6 jU ) ' 
Taking i = and j = d and combining (|2.5p - (|2.6p gives Theorem ll.2i 



(2.6) 



3 Stochastic constructions for Theorems 11.11 and 11.21 

In this section we consider Theorems 11.11 and 11.21 again in the case of a birth-and- 
death chain. For such a chain, the eigenvalues Uj or 9j are all real; this can be seen 
by perturbing (by an arbitrarily small amount) to get an ergodic generator or kernel, 
which is time-reversible and thus diagonally similar to a symmetric matrix. We show 
how the proof in Section [2] of Theorem 11.21 yields an explicit construction of independent 
geometric random variables summing to the hitting time in the case of a birth-and- 
death chain having nonnegative eigenvalues (for which it is sufficient that the holding 
probabilities satisfy pa > 1/2 for all i). We also exhibit an analogous representation of 
the hitting time in Theorem ll.il as a sum of independent exponential random variables 
in the case of a continuous-time birth-and-death chain. 

We begin with the discrete case. Returning to (12. II) . we now suppose that P is 
a discrete-time birth-and-death kernel with nonnegative eigenvalues Oj ordered so that 
< O < • • • < fd-i <0 d = l. The polynomials (P-6 I) ■ ■ ■ (P-^-iJ) in P appearing 
at ()2.ip are called spectral polynomials. We claim that the spectral polynomials are 
all nonnegative matrices, that is, that the matrices Qk at (|2.ip are stochastic; this 
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follows immediately by perturbation from Lemma 4.1 in [6] (which concerns ergodic 
birth-and-death kernels and is in turn an immediate consequence of |10t Theorem 3.2]. 
Reversibility of P plays a key role.) Thus the matrix A defined at (|2.3p is stochastic, 
as of course is the pure-birth kernel P defined at (jl.ip . For stochastic P, P, and A, 
the identity AP l = P t K at fl2.4|) is read as "the semigroups (P t )t>o and (P*)t>o are 
intertwined by the link A" . 

Whenever we have such an intertwining and (to be specific) Xq = 0, Section 2.4 of [2] 
shows one way to construct explicitly, from X and independent randomness, another 
Markov chain, say X , with kernel P such that 

C(X t \X ,...,X t ) = A(X t ,-) foralH. (3.1) 

In particular, since the link A is lower triangular [by Lemma I2.1( i)] and A(d, d) = 1 
(recall the proof of Lemma 12. 3p . so that A(d, •) is unit mass at d, it follows that X 
and X (and also Y of Theorem II. 2p have the same absorption time T. The discussion 
in [2] is used in that paper in the case that P is ergodic and P is absorbing, but it 
applies equally well to any intertwining. Once we have built the pure birth chain X, the 
independent geometric random variables summing to T are simply the waiting times 
between successive births in X. 

Here is our construction of X; it has the same form (but with A changed to A) as 
in Section 4.1 of [Bj, which applied to a different setting, and we repeat it here for the 
reader's convenience. The chain X starts with Xq = and we set Xq = 0. Inductively, 
we will have A(Xt,Xf) > (and so Xt < Xt) at all times t. The value we construct for 
Xt depends only on the values of Xt-i and Xt and independent randomness. Indeed, 
given X t -\ = x and X t = y, if y < x then our construction sets X t = x + 1 with 
probability 

P(x, x + l)A(g + 1, y) _ (1 - e £ )K(x + l,y) _ (1 - fla)Qa+i(0, y) 

(PA)(x,y) " ^.A(x,y) + (l-^)A(x + l,y) " (Q*P)(0,y) { ' } 

and X t = x with the complementary probability; if y = x + 1 (which is the only other 
possibility, since y = Xt < Xt-\ + 1 < x + 1 by induction), then we set Xt = x + 1 with 
certainty. 

Remark 3.1. (a) By (|3.ip and the lower-triangularity of A, our construction satisfies 
Xt < Xt for all t. Thus, among all discrete-time birth-and-death chains on {0, . . . , d} 
started at and with absorbing state d and given nonnegative eigenvalues < 0q < 9\ < 
■ ■ ■ < 8<i-i < 0d = 1) the pure-birth "spectral" kernel P defined at (jl.ip is stochastically 
maximal at every epoch t. 

(b) In the general setting of Theorem ll.2l we do not know any broad class of examples 
other than the birth-and-death chains we have just treated for which the eigenvalues 
are nonnegative real numbers and the spectral polynomials are nonnegative matrices. 
Nevertheless, the stochastic construction of the preceding paragraph applies verbatim 
to all such chains. 

For continuous-time birth-and-death chains, and more generally for continuous-time 
skip-free chains with real eigenvalues and generator G having nonnegative spectral poly- 
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nomials 

(G + u I)---(G + u k ^I), k = 0,...,d, 

there is an analogous construction. It takes a bit more effort to describe than for 
discrete-time chains, so we refer the reader to Section 5.1 of [j^, where again A there is 
changed to A here, for details. Briefly, if the bivariate chain (X,X) is in state (x,x) at 
a given jump time, then we construct an exponential random variable with rate 

r = Vxh.{x + 1, x)/A(x, x). 

If X jumps before the lifetime of this exponential expires, then X holds unless X jumps 
to x + 1, in which case X also jumps to x + 1. But if the exponential expires first, then 
at that expiration time X holds but X jumps to x + 1. 

Remark 3.2. Among all continuous-time birth-and-death chains on {0, . . . , d} started 
at and with absorbing state d and given eigenvalues vq > v\ > ■ ■ ■ > ua-i > v d = 
for —G. the pure-birth "spectral" chain with birth rate Vi at state i for i = 0, . . . , d is 
stochastically maximal at every epoch t. 

4 Proof of Theorem 11.4( b) 

In this section we prove Theorem II .41 (h)) . The proof is quite similar to that of Theo- 
rem [L2j so we shall be brief. Denote the eigenvalues of P by 9q, • • • > Since we assume 
that P is ergodic, precisely one of these, say 9^, equals 1 and the rest have modulus 
smaller than 1. 

Define the matrices Qk as at (|2,ip [so that (|2.2p holds] and define A as at (|2,3p . Then 
Lemma 12 . 1 1 holds for P of Theorem 11.4( b) by the same proof, as do properties (iv)-(v) 
of Section [TJ and the relation (|2.4p . As we will show next, Lemma 12.31 also holds in 
the present setting, and then the same argument as in the final paragraph of Section [2] 
completes the proof of Theorem 11.4( b). 

Lemma 4.1. Let P and X, with fastest strong stationary time T, be as in Theo- 
rem \1.4]( b) and let P be the bidiagonal matrix defined at (|l.ip . where 9q, ■ ■ ■ ,9^-1 are 
the non-unit eigenvalues of P. Then 

P(T < t) = P\0,d), t = 0,l,.... 

Proof. As in the proof of Lemma 12.31 the key is the intertwining relation 

Ap t = pt A) t = 0,l,..., (4.1) 

together with properties (iv)-(v) of Section [TJ We consider the (0,d) entries in (|4.ip . 
On the left we have 

(AP*)(0, d) = Y^ A(0, k)P t (k, d) = P\0, d) = P(T < t) n(d), (4.2) 

k 

where ir is the stationary distribution for P. We explain how to obtain the final equality. 
By the assumption that the time-reversal (call it P) of P is stochastically monotone, 
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the ratio P l (0, x) /ft(x) = P t (x, 0) /tt(0) is minimized by x = d, with minimum value (by 
definition) 1 — s(t), where s(t) is the so-called separation from stationarity for X. But, 
as is well known, a fastest strong stationary time T for X satisfies 

P(T < t) = 1 - s(t) for all t. 

On the right at (|4.ip we have 

(P*A)(0, d) = P\0, fc)A(fc, d) = P 4 (0, d)A(d, d). (4.3) 

Now equate fj4.2|) and (|4.3p and pass to the limit as t — > oo. On the left, P(T < t) — >■ 1 
(by ergodicity of X); on the right, P t (0,d) — > 1 by the argument presented in the 
final paragraph of the proof of Lemma 12.31 So A(d, d) = n(d), and now once again 
equating (|4.2|) and (|4.3|) and canceling the factor ir(d) > gives the desired result. 

□ 

Stochastic constructions for Theorem 11.41 have already been given for birth-and- 
death chains, in discrete time with nonnegative eigenvalues and in continuous time, 
in Section 4.1 (respectively, Section 5.1) in [6]. The constructions extend verbatim 
to all skip-free chains with real (and, in discrete time, nonnegative) eigenvalues and 
nonnegative spectral polynomials. 

5 General chains 

Our proof in Section [2] of the central Theorem 11.21 for the absorption time of a discrete- 
time skip-free chain rested on the construction of a matrix A having the properties 
(i)-(iii) listed in Section [TJ The question we wish to address in this section are: 

(a) Can Theorem 11.21 be extended to general absorbing chains? 

(b) Is the spectral-polynomials construction of A inevitable? That is, is the matrix A 
uniquely determined by the properties (i)-(iii) listed in Section Q]? 

(c) If the eigenvalues and spectral polynomials of a general absorbing chain are all 
nonnegative, can the stochastic construction for Theorem 11.21 be extended? 

As we shall see, the answer to each of these questions is yes. Question (b) arises 
naturally because (switching to continuous time and limiting attention to birth-and- 
death chains for the moment, since that is the setting of [3]) the proofs of Theorem ll.il 
both of Diaconis and Miclo [3] and of the present paper rely on construction of a link A 
such that AG = GA, where G (the analogue of P in continuous time) is the pure-birth 
"spectral" generator described in our Remark 13.21 The two methods of construction are 
strikingly different, so it is interesting that the end-product A is the same. 

All three questions can be addressed in the following setting generalizing that of The- 
orem [L2J (We shall subsequently call this "the general setting".) Consider a discrete- 
time Markov chain X with state space {0, . . . , d} and arbitrary initial distribution mo 
(regarded as a row vector in later calculations) and transition matrix P. Assume that 
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state d is absorbing and accessible from each other state. (There is no loss of generality 
in restricting the absorbing set to be a singleton.) Let 6q,. . . ,0d-i be the d non-unit 
eigenvalues of P (in fixed but arbitrary order — except where we require the nonnega- 
tivity of spectral polynomials, in which case we use nondecreasing order), let 9^ = 1, 
and let P be defined by (II. ip . All square matrices we consider have rows and columns 
indexed by the state space {0, . . . , d}. Let Qo, ■ ■ ■ ,Qd be the normalized spectral poly- 
nomials defined at (|2.ip . 

Questions (a) and (b) are answered in Section \5. II (see Lemma [5. II and Theorem 15.21 
respectively); question (c), in Section 1531 

5.1 Absorption times for general chains and the inevitability of spec- 
tral polynomials 

Our first result of this subsection demonstrates the inevitable use of spectral polynomials 
in the construction of A, even in the present general setting, provided Ao = rriQ. 

Lemma 5.1. The unique matrix A (with rows denoted by Ao, • • • , A^) satisfying the two 
conditions 

m = A and AP = PA (5.1) 

is given by 

Xi = m Qi, i = 0, . . . ,d. (5.2) 

Proof. It is easy to check, as in the proof of Lemma 12.11 that the choice (|5.2p satis- 
fies (15.1j) . Conversely, the ith row of AP = PA (i = 0, . . . , d — 1) requires 

\ i p = e i x i + (i-e i )\ i+1 , i.e., A i+ i = Xi [(i-e i )- l (p-e i i)] , 

and so (by induction) (15. ip implies (15. 2p . □ 

Our next result (Theorem 15 . 2|> . the main result of this section, greatly generalizes 
Theorem 11.21 by providing a tidy representation of the absorption-time distribution in 
the general setting. For the purposes of this result, we use the conventions that an 
empty sum vanishes and that A(—l,d) := and A(d + l,d) := 1. We also use the 
notation 

a k := A(k,d) - A(k-l,d), k = 0,...,d + l. (5.3) 

Observe that these 's sum to unity, and in the proof of Theorem 15.21 we show that 
a,i+i always vanishes. Moreover, if the spectral polynomials happen to be nonnegative, 
it is easily verified that a := (ao, . . . , a^+i) has real and nonnegative entries and so is 
a probability mass function. Finally, if the eigenvalues 0i are all real and nonnegative, 
then for k = 0, . . . , d we let Q(9q, . . . , 6k-i) denote the convolution of geometric distri- 
butions with respective success probabilities 1 — 9q, . . . , 1 — O^-i] we utilize the natural 
conventions here that this distribution is concentrated at when k = 0. 

Theorem 5.2. In the above general setting and notation, in particular with P defined 
at (jl.ip , A defined as in Lemma [5A\ at (|5.2p . and a& defined at (|5,3p . the absorption 
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time T satisfies 

d d 

P(T<t) =^a fc ]TP'(0,j), i = 0,1,2,..., (5.4) 

k=Q j=fe 
wii/i probability generating function 

d k-1 
fc=0 j=0 

In particular, if the eigenvalues Oi are all nonnegative real numbers and the spectral 
polynomials in P are all nonnegative matrices, then T is distributed as the a-mixture 
Ylt=o ak ^(^0i • • • j ^fc-i) °f the convolution distributions G(Oq, . . . , k = 0, . . . ,d. 

Proof. Lemma 15.11 is the key. Readily generalizing the proof of Lemma 12.31 and then 
using summation by parts, one finds 

P(T < t) = J2P\0,j)A(j,d) = J> fc ]TP'(0, j). 

j=0 k=0 j=k 

As shown in the proof of Lemma [231 P*(0, j) — > 5dj as t — > oo; thus 1 = ^fc=o o-k and 
so a^+i = 0. Equation (|5.4j) is all that is needed to establish (15, 5j) when the eigenvalues 
of P are nonnegative real numbers; in general one can use (|2,6p (we omit the routine 
details). □ 

Remark 5.3. (a) If the chain is upward skip-free and too = o~o, then = 84 k and 
Theorem 11.21 is recovered. 

(b) The following special case is treated in detail by Miclo [TTJ, albeit in somewhat 
different fashion. If there exists ir satisfying the detailed balance condition TViPij = ftjPji 
for all i,j € {0, . . . , d — 1}, then by an argument like the one at the beginning of our 
Section [3] the eigenvalues of P are nonnegative reals and the spectral polynomials are 
nonnegative. Thus, by Theorem 15.21 the absorption time is distributed as a mixture of 
convolutions of geometric distributions. Miclo also shows that > when the states 
in {0, . . . , d — 1} all communicate with one another. 

(c) At least one case other than that of part (b) of this remark is known for which 
the eigenvalues and spectral polynomials are all nonnegative. If P is upper triangular, 
then of course the eigenvalues are nonnegative reals, and He and Zhang [7J Appendix A] 
prove that the spectral polynomials are nonnegative. 

(d) A result analogous to Theorem 15.21 holds for continuous-time chains; we omit 
the details. See Miclo Section 1] for a discussion of connections with the extensive 
literature on so-called "phase-type" distributions. 

(e) A result similar to Theorem 15.21 (with a similar proof) holds for the distribution 
of the fastest strong stationary time of a general ergodic chain X with general initial 
distribution too and stationary distribution ir, provided P(Xt = x)/ir(x) is minimized 
for every t by the choice x = d. A sufficient condition for this is that the time-reversal 



(1 - 6j)u 



(5.5) 
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of P is stochastically monotone (with respect to the natural linear order on {0, . . . , d}) 
and mo(x)/ir(x) is decreasing in x (for example, thq = 5q). The theorem then has the 
same form as Theorem I5.2| except that now needs to be defined as 

[A(k,d) — X(k — l,d)]/iv(d). 



5.2 Stochastic construction via a modified link 

We continue to study the general setting. Whenever the eigenvalues and spectral polyno- 
mials are all nonnegative, the link A of (|5.2[) provides an intertwining of the semigroups 
(P t )t>o and (P*)4>o [recall (|5.ip ], and again (as in Section [3]) a chain X with kernel P 
can be constructed such that Xq = and the "sample-path intertwining" relation (|3.1|) 
holds. However, unlike for skip-free chains, in the general case although we do have 
A(d, d) = 1 (this simply restates our earlier observation that a^+i = 0) there is no 
guarantee that the link A is lower triangular and thus all we can say with certainty is 
that the absorption times T for X and T (say) for X satisfy T <T. We will rectify this 
situation by modifying the link A; this will not contradict the uniqueness of A proven in 
Lemma GTTJ because we will also substitute a different "dual" kernel P for the spectral 
kernel P. 

To set the stage in the general setting without yet imposing any assumptions about 
nonnegativity of eigenvalues or spectral polynomials, define the matrix A with rows 
A , • • • , Xd by setting 



A; 



XiidT^Xi-X^dd] ifAi(d)^l 

if Xi(d) = 1, 



(i = 0,...,d) 



where 5d is the coordinate row vector (0, 0, ... , 0, 1), and note that the rows of A, like 
those of A, sum to unity. Further, define P := B + R to be the sum of the bidiagonal 
upper triangular matrix B and the matrix R with rank at most 1 vanishing in all 
columns except for the last defined by 



ho ■- 



l-X i+1 (d) /-. 
l-Ai(d) V 1 



J) 

and [recalling the notation (|5.3j) ] 

l-X i+1 (d) 



rid ■-- 

if Xi(d) ^ 1, and by 



1 



l-W) 



(1 



if j = i 
i£j = i + l 
otherwise, 

" 1 - Xi(d) 



(1 - 9i 



bij ■ — Ojj, 



Ud ■= 



if Xi(d) = 1. Observe that the rows of P sum to unity. Finally, let mo be the probability 
row vector 

W := (1 - m o (d),0, . . . , 0, m (d)) = (1 - a , 0, . . . , 0, a ). 



The following key fact follows by straightforward calculations from Lemma 15. 11 
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Lemma 5.4. In the general setting and the above notation, 

mo = mo A and AP = P A. 

One can use Lemma [5. 41 in place of Lemma [5. II to give another proof of Theorem l5.21 
But much more is possible when the eigenvalues and spectral polynomials of P are 
all nonnegative, as we shall henceforth assume. In that case we have the following 
conclusions (with all proofs entirely routine): 

• A and P are both stochastic, and so the semigroups (P*)t>o and (P )t>o are 
intertwined by the link A. 

• The set A of absorbing states for a chain X with kernel P satisfies A = {d, . . . , d}, 
where 

d : = min{z : \i(d) = 1} = min{i : m = 0} — 1 G {0, . . . , d}, 
and a% = if and only if % > d + 1. 

• For a P-chain, from each state in {0, . . . ,d — 1} the states d and d are each 
accessible but none of the other states in A is. 

• The construction of Section 2.4 of [2] allows us to build, from X and independent 
randomness, a chain X with initial distribution m and kernel P such that 

C(X t \X ,...,X t ) = A(X t , •) for all t. 

• The time T to absorption in state d for X is the same (sample-pathwise) as the 
time to absorption (call it T) in A (i.e., in {d, d}). 

• Let L be the largest value reached by X prior to absorption, with the convention 
L := —1 if the initial state of X is d. Then P(L = k — 1) = for all k = 0, . . . , d. 
Further, conditionally given L = k — 1, the amounts of time it takes for X to 
move up from to 1, from 1 to 2, ... , from k — 2 to k — 1, and from k — 1 to A 
are independent geometric random variables with respective success probabilities 
1 — 6> , 1 - 0i, . . . , 1 — k _2, 1 — 9k-i- 

Thus in the case of nonnegative eigenvalues and spectral polynomials we have en- 
riched the conclusion of Theorem 15.21 by means of a stochastic construction identifying 
(i) a random variable (namely, L + l) having probability mass function (a&); and, condi- 
tionally given L + l = k, (ii) individual geometric random variables whose distributions 
appear in the convolution G{0q, . . . , 6k-i)- 

Remark 5.5. Recall Remark 15.31 If the conditions described there of detailed balance 
and complete communication within {0, ... ,d — 1} both hold, then d = d and thus 
A = {d} is a singleton. 

Acknowledgments. Section [5] was added after the author read the interesting 
paper [TTJ by Laurent Miclo; we thank him for providing a preprint, which the results 
of Section [5] generalize. We also thank Mark Brown for inspirational discussions. 
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